An Opinion Analysis Tool for Colloquial and Standard Arabic

نویسندگان

  • Mohammed Al-Kabi
  • Amal Gigieh
  • Izzat Alsmadi
  • Mohamad Haidar
چکیده

Social networks and users’ interactions are distinct features for the current Web. They constitute a fundamental part of Web 2.0, where people produce, disseminate, and consume information in new interactive forms where users are not only passive information receivers. Social media succeed to attract a large portion of online users, which explains the explosive growth of social media in terms of comments, reviews, blogs, microblogs, Twitters, and postings in social network sites. In this scope, sentiment analysis research field refers to the analysis of people’s sentiments, opinions, attitudes, and emotions towards events, products, companies, individuals, issues, sport teams ...etc. Facebook, and YouTube are within the top 3 sites used in many Middle Eastern (ME) countries, and the world. Therefore a huge volume of Arabic comments and reviews are generated daily about different aspects of life in this part of the world. Modern Standard Arabic (MSA) is used mainly in media (Newspapers, Journals, TV and Radio), academic institution, and to some extent in social media. While colloquial Arabic is used by the public in their conversations, chatting, etc.. Analysis of social networks in ME countries shows that both MSA and colloquial or slang languages are used. The aim of this study is to build a novel sentiment analysis tool called colloquial Non-Standard Arabic Modern Standard Arabic-Sentiment Analysis Tool (CNSAMSA-SAT) dedicated to both colloquial Arabic and MSA. A large number of Arabic collected comments and reviews from social media were tokenized and analyzed to build polarity lexicons which constitute an essential part of CNSA-MSA-SAT. Each Arabic collected comment and review is manually assigned to one of the three polarity values: (positive, negative, and neutral). Further, each collected review or comment is added to CNSA-MSA-SAT and is then assigned to one of the three polarities values based on algorithms developed for this purpose.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Opinion Mining and Analysis for Arabic Language

Social media constitutes a major component of Web 2.0 and includes social networks, blogs, forum discussions, micro-blogs, etc. Users of social media generate a huge volume of reviews and comments on a daily basis. These reviews and comments reflect the opinions of users about different issues, such as: products, news, entertainments, or sports. Therefore different establishments may need to an...

متن کامل

Idioms-Proverbs Lexicon for Modern Standard Arabic and Colloquial Sentiment Analysis

Although, the fair amount of works in sentiment analysis (SA) and opinion mining (OM) systems in the last decade and with respect to the performance of these systems, but it still not desired performance, especially for morphologically-Rich Language (MRL) such as Arabic, due to the complexities and challenges exist in the nature of the languages itself. One of these challenges is the detection ...

متن کامل

A Hybrid Approach for Converting Written Egyptian Colloquial Dialect into Diacritized Arabic

Recently the rate of written colloquial text has increased dramatically. It is being used as a medium of expressing ideas especially across the WWW, usually in the form of blogs and partially colloquial articles. Most of these written colloquial has been in the Egyptian colloquial dialect, which is considered the most widely dialect understood and used throughout the Arab world. Modern Standard...

متن کامل

Transforming Standard Arabic to Colloquial Arabic

We present a method for generating Colloquial Egyptian Arabic (CEA) from morphologically disambiguated Modern Standard Arabic (MSA). When used in POS tagging, this process improves the accuracy from 73.24% to 86.84% on unseen CEA text, and reduces the percentage of out-ofvocabulary words from 28.98% to 16.66%. The process holds promise for any NLP task targeting the dialectal varieties of Arabi...

متن کامل

Sentiment Analysis For Modern Standard Arabic And Colloquial

The rise of social media such as blogs and social networks has fueled interest in sentiment analysis. With the proliferation of reviews, ratings, recommendations and other forms of online expression, online opinion has turned into a kind of virtual currency for businesses looking to market their products, identify new opportunities and manage their reputations, therefore many are now looking to...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2013